AITopics | research interest

Collaborating Authors

research interest

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Urban 3D Change Detection Using LiDAR Sensor for HD Map Maintenance and Smart Mobility

Albagami, Hezam, Wang, Haitian, Wang, Xinyu, Ibrahim, Muhammad, Malakan, Zainy M., Alqamdi, Abdullah M., Alghamdi, Mohammed H., Mian, Ajmal

arXiv.org Artificial IntelligenceOct-27-2025

High-definition 3D city maps underpin smart transportation, digital twins, and autonomous driving, where object level change detection across bi temporal LiDAR enables HD map maintenance, construction monitoring, and reliable localization. Classical DSM differencing and image based methods are sensitive to small vertical bias, ground slope, and viewpoint mismatch and yield cellwise outputs without object identity. Point based neural models and voxel encodings demand large memory, assume near perfect pre alignment, degrade thin structures, and seldom enforce class consistent association, which leaves split or merge cases unresolved and ignores uncertainty. We propose an object centric, uncertainty aware pipeline for city scale LiDAR that aligns epochs with multi resolution NDT followed by point to plane ICP, normalizes height, and derives a per location level of detection from registration covariance and surface roughness to calibrate decisions and suppress spurious changes. Geometry only proxies seed cross epoch associations that are refined by semantic and instance segmentation and a class constrained bipartite assignment with augmented dummies to handle splits and merges while preserving per class counts. Tiled processing bounds memory without eroding narrow ground changes, and instance level decisions combine 3D overlap, normal direction displacement, and height and volume differences with a histogram distance, all gated by the local level of detection to remain stable under partial overlap and sampling variation. On 15 representative Subiaco blocks the method attains 95.2% accuracy, 90.4% mF1, and 82.6% mIoU, exceeding Triplet KPConv by 0.2 percentage points in accuracy, 0.2 in mF1, and 0.8 in mIoU, with the largest gain on Decreased where IoU reaches 74.8% and improves by 7.6 points.

detection, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2510.21112

Country:

North America > United States (1.00)
Oceania > Australia (0.96)
Asia > Middle East > Saudi Arabia (0.16)

Genre: Research Report (0.50)

Industry:

Government > Regional Government (1.00)
Information Technology (0.88)
Transportation > Ground > Road (0.34)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (0.48)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.46)
(2 more...)

Add feedback

Quantum Computing for Large-scale Network Optimization: Opportunities and Challenges

Macaluso, Sebastian, Geraci, Giovanni, Combarro, Elías F., Abadal, Sergi, Arapakis, Ioannis, Vallecorsa, Sofia, Alarcón, Eduard

arXiv.org Artificial IntelligenceSep-10-2025

Abstract--The complexity of large-scale 6G-and-beyond networks demands innovative approaches for multi-objective optimization over vast search spaces, a task often intractable. Quantum computing (QC) emerges as a promising technology for efficient large-scale optimization. We present our vision of leveraging QC to tackle key classes of problems in future mobile networks. By analyzing and identifying common features, particularly their graph-centric representation, we propose a unified strategy involving QC algorithms. Specifically, we outline a methodology for optimization using quantum annealing as well as quantum reinforcement learning. Additionally, we discuss the main challenges that QC algorithms and hardware must overcome to effectively optimize future networks. Quantum computing (QC) has rapidly emerged as a promising field, with its unparalleled potential to tackle problems typically intractable for classical computers. Quantum bits (qubits) leverage the principles of superposition, interference and entanglement to accelerate computations and open the door to previously unimaginable algorithms. This fundamental characteristic allows quantum computers to perform complex calculations at speeds exponentially faster than their classical counterparts in certain domains, enabling breakthroughs in fields such as cryptography, materials science, and artificial intelligence (AI). Developments in QC pave the way for novel solutions to intractable optimization problems and are expected to play a disruptive role in multiple industries.

machine learning, optimization, reinforcement learning, (18 more...)

arXiv.org Artificial Intelligence

2509.07773

Country: Europe (0.28)

Genre:

Research Report > Promising Solution (0.54)
Overview > Innovation (0.34)

Industry:

Telecommunications (1.00)
Information Technology > Security & Privacy (0.34)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.89)

Add feedback

CSGO: Generalized Optimization for Cold Start in Wireless Collaborative Edge LLM Systems

Liu, Xuran, Xue, Nan, Bao, Rui, Sun, Yaping, Chen, Zhiyong, Tao, Meixia, Xu, Xiaodong, Cui, Shuguang

arXiv.org Artificial IntelligenceAug-18-2025

While deploying large language models on edge devices promises low-latency and privacy-preserving AI services, it is hindered by limited device resources. Although pipeline parallelism facilitates distributed inference, existing approaches often ignore the cold-start latency caused by on-demand model loading. In this paper, we propose a latency-aware scheduling framework that overlaps model loading with computation and communication to minimize total inference latency. Based on device and model parameters, the framework dynamically adjusts layer partitioning and allocation to effectively hide loading time, thereby eliminating as many idle periods as possible. We formulate the problem as a Mixed-Integer Non-Linear Program and design an efficient dynamic programming algorithm to optimize model partitioning and device assignment. Experimental results show that the proposed method significantly reduces cold-start latency compared to baseline strategies.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2508.11287

Country: Asia > China (1.00)

Genre: Research Report > New Finding (0.34)

Industry:

Information Technology (0.68)
Telecommunications (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.89)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.86)

Add feedback

Physiological Signal-Driven QoE Optimization for Wireless Virtual Reality Transmission

Wu, Chang, Chen, Yuang, Chen, Yiyuan, Guo, Fengqian, Qin, Xiaowei, Lu, Hancheng

arXiv.org Artificial IntelligenceAug-14-2025

Abrupt resolution changes in virtual reality (VR) streaming can significantly impair the quality-of-experience (QoE) of users, particularly during transitions from high to low resolutions. Existing QoE models and transmission schemes inadequately address the perceptual impact of these shifts. To bridge this gap, this article proposes, for the first time, an innovative physiological signal-driven QoE modeling and optimization framework that fully leverages users' electroencephalogram (EEG), electrocardiogram (ECG), and skin activity signals. Integrated the proposed QoE framework into the radio access network (RAN) via a deep reinforcement learning (DRL) framework, adaptive transmission strategies have been implemented to allocate radio resources dynamically, which mitigates short-term channel fluctuations and adjusts frame resolution in response to channel variations caused by user mobility. By prioritizing long-term resolution while minimizing abrupt transitions, the proposed solution achieves an 88.7% improvement in resolution and an 81.0% Experimental results demonstrate the effectiveness of this physiological signal-driven strategy, underscoring the promise of edge AI in immersive media services. While this technology enables unprecedented engagement in applications ranging from event viewing to interactive education, its reliance on wireless transmission poses critical challenges. The uncompressed data rates exceeding 1 Gbps and latency thresholds below 20 ms impose stringent demands on network infrastructure, particularly in mobile scenarios where channel fluctuations and user mobility degrade service consistency. Traditional quality of service (QoS) metrics (e.g., bandwidth, jitter, and packet loss) provide necessary but insufficient insights into user satisfaction, necessitating perceptual quality of experience (QoE) frameworks tailored to user's unique requirements [3]-[5].

artificial intelligence, machine learning, resolution, (18 more...)

arXiv.org Artificial Intelligence

2508.09151

Country: Asia > China (0.14)

Genre: Research Report > New Finding (0.48)

Industry:

Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.48)
Health & Medicine > Diagnostic Medicine (0.48)
Telecommunications > Networks (0.34)
Education > Educational Setting (0.34)

Technology:

Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (1.00)
Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

FedHiP: Heterogeneity-Invariant Personalized Federated Learning Through Closed-Form Solutions

Tang, Jianheng, Yang, Zhirui, Wang, Jingchao, Fan, Kejia, Xu, Jinfeng, Zhuang, Huiping, Liu, Anfeng, Song, Houbing Herbert, Wang, Leye, Liu, Yunhuai

arXiv.org Artificial IntelligenceAug-7-2025

Lately, Personalized Federated Learning (PFL) has emerged as a prevalent paradigm to deliver personalized models by collaboratively training while simultaneously adapting to each client's local applications. Existing PFL methods typically face a significant challenge due to the ubiquitous data heterogeneity (i.e., non-IID data) across clients, which severely hinders convergence and degrades performance. We identify that the root issue lies in the long-standing reliance on gradient-based updates, which are inherently sensitive to non-IID data. To fundamentally address this issue and bridge the research gap, in this paper, we propose a Heterogeneity-invariant Personalized Federated learning scheme, named FedHiP, through analytical (i.e., closed-form) solutions to avoid gradient-based updates. Specifically, we exploit the trend of self-supervised pre-training, leveraging a foundation model as a frozen backbone for gradient-free feature extraction. Following the feature extractor, we further develop an analytic classifier for gradient-free training. To support both collective generalization and individual personalization, our FedHiP scheme incorporates three phases: analytic local training, analytic global aggregation, and analytic local personalization. The closed-form solutions of our FedHiP scheme enable its ideal property of heterogeneity invariance, meaning that each personalized model remains identical regardless of how non-IID the data are distributed across all other clients. Extensive experiments on benchmark datasets validate the superiority of our FedHiP scheme, outperforming the state-of-the-art baselines by at least 5.79%-20.97% in accuracy.

artificial intelligence, fedhip scheme, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2508.0447

Country:

Asia (0.69)
North America > United States > Maryland (0.46)

Genre: Research Report (0.50)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Physics-Embedded Neural ODEs for Sim2Real Edge Digital Twins of Hybrid Power Electronics Systems

Zheng, Jialin, Wang, Haoyu, Zeng, Yangbin, Mou, Di, Zhang, Xin, Li, Hong, Vazquez, Sergio, Franquelo, Leopoldo G.

arXiv.org Artificial IntelligenceAug-6-2025

--Edge Digital Twins (EDTs) are crucial for monitoring and control of Power Electronics Systems (PES). However, existing modeling approaches struggle to consistently capture continuously evolving hybrid dynamics that are inherent in PES, degrading Sim-to-Real generalization on resource-constrained edge devices. To address these challenges, this paper proposes a Physics-Embedded Neural ODEs (PENODE) that (i) embeds the hybrid operating mechanism as an event automaton to explicitly govern discrete switching and (ii) injects known governing ODE components directly into the neural parameterization of unmodeled dynamics. This unified design yields a differentiable end-to-end trainable architecture that preserves physical interpretability while reducing redundancy, and it supports a cloud-to-edge toolchain for efficient FPGA deployment. Experimental results demonstrate that PENODE achieves significantly higher accuracy in benchmarks in white-box, gray-box, and black-box scenarios, with a 75% reduction in neuron count, validating that the proposed PENODE maintains physical interpretability, efficient edge deployment, and real-time control enhancement. OWER electronics systems (PES) are the fundamental to drive efficient energy conversion [1] but require precise and real-time monitoring and predictive analysis due to the ultra-high standards for reliability and performance [2]. Digital Twin (DT), a high-fidelity virtual counterpart of a physical asset, presents a promising solution [3]. However, it is difficult to implement cloud-or-server-based DTs with high communication latency and limited bandwidth in PES because the PES dynamics differ significantly from power grids [4].

converter, machine learning, real time system, (18 more...)

arXiv.org Artificial Intelligence

2508.02887

Country:

Asia > China (0.95)
Europe (0.68)

Genre: Research Report > New Finding (0.66)

Industry: Energy (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (0.93)
Information Technology > Architecture > Real Time Systems (0.91)

Add feedback

A Novel ViDAR Device With Visual Inertial Encoder Odometry and Reinforcement Learning-Based Active SLAM Method

Xin, Zhanhua, Wang, Zhihao, Zhang, Shenghao, Chi, Wanchao, Meng, Yan, Kong, Shihan, Xiong, Yan, Zhang, Chong, Liu, Yuzhen, Yu, Junzhi

arXiv.org Artificial IntelligenceJun-17-2025

Abstract--In the field of multi-sensor fusion for simultaneous localization and mapping (SLAM), monocular cameras and IMU s are widely used to build simple and effective visual-inerti al systems. However, limited research has explored the integr ation of motor-encoder devices to enhance SLAM performance. By incorporating such devices, it is possible to significantly improve active capability and field of view (FOV) with minimal additi onal cost and structural complexity. This paper proposes a novel visual-inertial-encoder tightly coupled odometry (VIEO) based on a ViDAR (Video Detection and Ranging) device. A ViDAR calibration method is introduced to ensure accurate initia lization for VIEO. In addition, a platform motion decoupled active SLAM method based on deep reinforcement learning (DRL) is proposed. Experimental data demonstrate that the proposed Vi-DAR and the VIEO algorithm significantly increase cross-fra me co-visibility relationships compared to its correspondin g visual-inertial odometry (VIO) algorithm, improving state estima tion accuracy. The proposed methodolog y sheds fresh insights into both the updated platform design and decoupled approach of active SLAM systems in complex environments. N recent years, visual odometry (VO) and visual-inertial odometry (VIO) have made significant advancements. This work was supported in part by the Beijing Natural Scienc e Foundation under Grant 2022MQ05, in part by the CIE-Tencent Robotics X R hino-Bird Focused Research Program under Grant 2022-07, and in part by the National Natural Science Foundation of China under Grant 62203015, G rant 62303020, Grant 62303021, and Grant 62273351. Zhanhua Xin, Zhihao Wang, Shihan Kong, Y an Xiong, and Junzhi Y u are with the State Key Laboratory for Turbulence and Complex Systems, Department of Advanced Manufacturing and Robotics, C ollege of Engineering, Peking University, Beijing 100871, China (email: xinzhan-hua@stu.pku.edu.cn;

artificial intelligence, machine learning, reinforcement learning, (19 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/TII.2025.3567391

2506.131

Country: Asia > China > Beijing > Beijing (0.46)

Genre: Research Report (0.50)

Industry: Health & Medicine > Therapeutic Area (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Interpretable Clustering Ensemble

Lv, Hang, Hu, Lianyu, Jiang, Mudi, Liu, Xinying, He, Zengyou

arXiv.org Artificial IntelligenceJun-9-2025

--Clustering ensemble has emerged as an important research topic in the field of machine learning. Although numerous methods have been proposed to improve clustering quality, most existing approaches overlook the need for interpretability in high-stakes applications. In domains such as medical diagnosis and financial risk assessment, algorithms must not only be accurate but also interpretable to ensure transparent and trustworthy decision-making. Therefore, to fill the gap of lack of interpretable algorithms in the field of clustering ensemble, we propose the first interpretable clustering ensemble algorithm in the literature. By treating base partitions as categorical variables, our method constructs a decision tree in the original feature space and use the statistical association test to guide the tree building process. Experimental results demonstrate that our algorithm achieves comparable performance to state-of-the-art (SOT A) clustering ensemble methods while maintaining an additional feature of interpretability. T o the best of our knowledge, this is the first interpretable algorithm specifically designed for clustering ensemble, offering a new perspective for future research in interpretable clustering. LUSTERING analysis [1] is an unsupervised learning issue in the field of data mining, which aims to partition data into different clusters by exploring its intrinsic structure.

algorithm, artificial intelligence, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2506.05877

Country: Asia > China (0.69)

Genre: Research Report > New Finding (0.34)

Industry:

Education (0.89)
Health & Medicine (0.89)
Banking & Finance (0.54)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Add feedback

Motion Generation for Food Topping Challenge 2024: Serving Salmon Roe Bowl and Picking Fried Chicken

Inami, Koki, Konosu, Masashi, Yamane, Koki, Masuya, Nozomu, Li, Yunhan, Shu, Yu-Han, Sato, Hiroshi, Homma, Shinnosuke, Sakaino, Sho

arXiv.org Artificial IntelligenceMay-1-2025

FULL PAPER Motion Generation for Food Topping Challenge 2024: Serving Salmon Roe Bowl and Picking Fried Chicken Koki Inami a, Masashi Konosu a, Koki Yamane a, Nozomu Masuya a, Yunhan Li a, Yu-Han Shu a, Hiroshi Sato a, Shinnosuke Homma a, and Sho Sakaino b a Intelligent and Mechanical Interaction Systems, Degree Programs in Systems and Information and Engineering, Graduate School of Science and Technology, University of Tsukuba, Japan; b Department of Intelligent Interaction Technologies, Institute of Systems and Information Engineering, University of Tsukuba, Japan; ARTICLE HISTORY Compiled May 1, 2025 ABSTRACT Although robots have been introduced in many industries, food production robots are yet to be widely employed because the food industry requires not only delicate movements to handle food but also complex movements that adapt to the environment. Force control is important for handling delicate objects such as food. In addition, achieving complex movements is possible by making robot motions based on human teachings. Four-channel bilateral control is proposed, which enables the simultaneous teaching of position and force information. Moreover, methods have been developed to reproduce motions obtained through human teachings and generate adaptive motions using learning. We demonstrated the effectiveness of these methods for food handling tasks in the Food Topping Challenge at the 2024 IEEE International Conference on Robotics and Automation (ICRA 2024). For the task of serving salmon roe on rice, we achieved the best performance because of the high reproducibility and quick motion of the proposed method. Further, for the task of picking fried chicken, we successfully picked the most pieces of fried chicken among all participating teams. This paper describes the implementation and performance of these methods. KEYWORDS Bilateral control; motion-copying system; imitation learning; 1. Introduction Robot automation has progressed in recent years, especially in the manufacturing industry because robots excel at precise repetitive movements in a well-equipped environment. However, their use in the food industry has been limited because of the complexity of the work and the need to generate movements that adapt to the work environment.

artificial intelligence, deep learning, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2504.19498

Country: Asia > Japan > Honshū > Kantō > Ibaraki Prefecture > Tsukuba (0.46)

Genre: Research Report (0.50)

Industry:

Education > Educational Setting > Higher Education (0.55)
Consumer Products & Services > Food, Beverage, Tobacco & Cannabis (0.54)
Food & Agriculture > Agriculture (0.34)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Flying through cluttered and dynamic environments with LiDAR

Wu, Huajie, Liu, Wenyi, Ren, Yunfan, Liu, Zheng, Wei, Hairuo, Zhu, Fangcheng, Li, Haotian, Zhang, Fu

arXiv.org Artificial IntelligenceApr-25-2025

IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS Flying through cluttered and dynamic environments with LiDAR Huajie Wu, Wenyi Liu, Y unfan Ren, Zheng Liu, Hairuo Wei, Fangcheng Zhu, Haotian Li, and Fu Zhang Abstract --Navigating unmanned aerial vehicles (UAVs) through cluttered and dynamic environments remains a significant challenge, particularly when dealing with fast-moving or sudden-appearing obstacles. This paper introduces a complete LiDAR-based system designed to enable UAVs to avoid various moving obstacles in complex environments. Benefiting the high computational efficiency of perception and planning, the system can operate in real time using onboard computing resources with low latency. For dynamic environment perception, we have integrated our previous work, M-detector, into the system. M-detector ensures that moving objects of different sizes, colors, and types are reliably detected. For dynamic environment planning, we incorporate dynamic object predictions into the integrated planning and control (IPC) framework, namely DynIPC. This integration allows the UAV to utilize predictions about dynamic obstacles to effectively evade them. We validate our proposed system through both simulations and real-world experiments. In simulation tests, our system outperforms state-of-the-art baselines across several metrics, including success rate, time consumption, average flight time, and maximum velocity. Index Terms --LiDAR-based UAV, dynamic obstacle avoidance, cluttered and dynamic environment I. I NTRODUCTION I N recent years, the development of lightweight and high-precision sensors, such as Light Detection and Ranging sensors (LiDAR), event cameras, and depth cameras, has significantly advanced the autonomous flight capabilities of unmanned aerial vehicles (UA Vs) or drones. This technological progress has facilitated the widespread application of drones across various industries, including agricultural spraying [1], product delivery [2], inspection [3], and search and rescue [4]. These applications have notably enhanced production efficiency, reduced costs, and driven economic growth within these sectors.

artificial intelligence, dynamic obstacle, obstacle, (16 more...)

arXiv.org Artificial Intelligence

2504.17569

Country:

Asia > China (0.98)
North America > United States > California (0.28)

Genre: Research Report (0.64)

Industry: Information Technology > Robotics & Automation (0.74)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Vision (0.87)

Add feedback